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Abstract 

In this paper 1 we consider the Skorokhod embedding problem for target distri- 
butions with non-zero mean. In the zero-mean case, uniform integrability provides 
a natural restriction on the class of embeddings, but this is no longer suitable when 
the target distribution is not centred. Instead we restrict our class of stopping times 
to those which are minimal, and we find conditions on the stopping times which are 
equivalent to minimality. 

We then apply these results, firstly to the problem of embedding non-centred target 
distributions in Brownian motion, and secondly to embedding general target laws in a 
diffusion. 

We construct an embedding (which reduces to the Azema-Yor embedding in the 
zero-target mean case) which maximises the law of sup s<T B s among the class of min- 
imal embeddings of a general target distribution n in Brownian motion. We then 
construct a minimal embedding of fj, in a diffusion X which maximises the law of 
sup s<T h(X s ) for a general function h. 



1 Introduction 



The S korokhod embedding problem was first proposed, and then solved, by Skorokhodl 
ijlQfiFih . and may be described thus: 

Given a Brownian motion (B t )t>o arL d & centred target law fi can we find a 
stopping time T such that Bt has distribution /x? 
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Skorokhod gave an explicit construction of the stopping time T in terms of independent 
random variables, and in doing so showed that any zero- mean probability measure may be 
embedded in Brownia n motion . Since the problem was p osed many more solutions have been 
given, see for example Dubinsl jlQBsh . IliootJ dl96flh and Chacon and Walsbl lll97fil ). and the 
comprehensive survey article of 




|1969l) and lChacon and Walsbl (|1J_ 
:»16il l)2004fl . With different solutions comes the question 
of optimal properties of the embeddings, and various optimal embeddings have been found 



for example the embedding minimising the variance of T dRostl Il976h . the embeddin; 
minimising in convex order the law of the local time at zero ( VaUojd 1992^ . or the embedding 
stochastically minimising the law of the maximum |Perkina1l98fil) . 

An obvious extension of the problem is to consider more general classes of processes. 
Here the question of the existence of an embedding becomes more interesting. For a 
Markov process and an arbitrary target measure necessary and suffi cient conditions are 
given by Rostl 1 1971 ) and a construction in this general case is given by Bertoin and Le Janl 
1992^1. In the case of di f fusion s on R simpler necessa r y and suffi cient conditions are give n 



Grandits and Falkner ( 200 



Pedersen and Peskirl (|2001) and 



Cox and Hobsor 
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along with some constructions. 

The work we present here was motivated by the following question: 

Given a diffusion (X t )t>o and a target distribution fix for which an embed- 
ding exists, which embedding maximises the law of sup s<T X s (respectively 

SUP S <T W)? 

For Brownia n motion, the question has been solved by Azema and Yoi 1 1979n[l (respectively 
■Tacka 1 1988^ ) under the condition that B t AT is a UI- martingale. 

There are several considerations that need to be made when moving fro m the Brownian 
case to the diffusion case . Firstly, the mean-zero assumption that is made by 



Azema and Yor 



1 1979(1) and Jackal 1 19881) is no longer natural since we are no longer necessarily dealing with 
a martingale. The second aspect that needs to be considered is with what restriction should 
we replace the UI condition? That such a condition is desirable may be seen by considering 
a recurrent diffusion. Here the maximisation problem can easily seen to be degenerate by 
considering first running the diffusion until it hits a level x, allowing it to return to its 
starting point and then using the reader's favourite embedding. Clearly this dominates the 
unmo dified version of the reader's favourite embedding. 

In iPedersen and Peskirl l|200l|) an integrability condition on the maximum (specifically 



that E(sup f<T s(X t )) < oo where s is the scale function of X) was suggested to replace the 
UI condition in the Brownia n case. I n this work we propose using the following class of 
stopping times introduced by Monroe! 1 19721) to provide us with a natural restriction on the 
set of admissible embeddings. 

Definition 1. A stopping time T for the process X is minimal if whenever S < T is a 
stopping time such that X$ and Xt have the same distribution then S — T a.s.. 

The class of minimal stopping times provides us with a natural class of 'good' stop- 
ping times. In the Brownian case it includes as a subclass those embeddings for which 
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E[sup s<T B s ] is finite. Furthermore, there is a link to the uniformly intcgrable case as a 
consequence of the following result: 



Theorem 2. 



Monroe 



191 a. Theorem 3) Let T be an embedding in Brownian motion of 
a centred distribution fj,. Then T is minimal if and only if the process B t/ \T is uniformly 
integrable, or equivalently if¥.[BT\J~s] — Bs for all stopping times S <T. 

Our first result extends this theorem to non-centred target distributions. 

Theorem 3. Let T be an embedding in Brownian motion of a non- centred distribution pL 
with mean m < 0. Then T is minimal if and only if the process B~^ T is uniformly integrable, 
or equivalently if^B^Fs] < Bs for all stopping times S <T. 

It is clear that the notion of a minimality fits well with the problem of embedding in any 
process, such as a diffusion, and not just Brownian motio n. Our approach to embeddings 
in diffusions can be traced back to 



Azema and Yor (19795) and will be to map the diffusion 



into natural scale (so that, up to a time change, it resembles Brownian motion) and use 
techniques developed for embedding Brownian motion. Using this method on a transient 
diffusion one finds that the state space and target distribution for the Brownian motion 
is restricted to a half- line (or sometimes a finite interval). We will show minimality to be 
equivalent to stopping the Brownian motion before it leaves this interval, so that a minimal 
stopping time is necessarily before the first explosion time of X. 

When we map from the problem of embedding fj,x in X to the Brownian motion the 
target law [i we obtain for B is the image of pLx under the scale function. The key point is 
that there is no reason why this target law should have mean zero. Thus, unlike most of the 
other studies of Skorokhod embeddings in Brownian motion we are interested in non-centred 
target distributions, and non-UI stopping times. Instead, as described in Theorem [3] the 
class of 'good' stopping times satisfy slightly different integrability conditions. 

Having characterised the class of minimal embeddings, we then turn to the problem 
of constructing optimal embeddings in Brownian motion for non-centred target distribu- 
tions. In particular, amongst the class of minimal embeddings we find the stopping time 
which maximises the law of the maximum of the stopped process. In the centred case this 
embedding reduces to the classical Azema- Yor embedding. In fact most of the paper will 
concentrate on embedding non-centred target distributions in B, and we will only return to 
the diffusion case in a short final section. 

The paper will proceed as follows. In Section |2 we prove some results concerning min- 
imality of stopping times for non-centred target distributions, giving equivalent conditions 
to minimality in terms of the process. Next, in Section |21 we construct an extension of the 
Azema- Yor embedding for non-centred target distributions and show both that it is mini- 
mal, and that it retains the optimality properties of the original Azema- Yor embedding. In 
Section 0] we use these stopping times to construct an embedding maximising the distribu- 
tion of sup s<T h(B s ) for a general function h. Finally in SectionElwe apply these results to 
the problem of embedding optimally in diffusions. 
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2 Minimal Embeddings for Non-centred Distributions 



In this section we examine the properties of minimal stopping times. In particular, for 
embeddings in Brownian motion we aim to find equivalent conditions to minimality when 
the target distribution is not centred. 

We begin by noting the following result from lMonro which justifies the existence 

of minimal stopping times: 



Proposition 4 ( Monroel ( 1972h . Proposition 2). For any stopping time T there exists 



a minimal stopping time S <T such that B$ ~ Bt- 

For (3 E R define Hp = inf{< > : B t = 0}, the first hitting time of the level (3. The 
main result of this section is the following: 

Theorem 5. Let T be a stopping time of Brownian motion which embeds an integrable 
distribution fi where m = J R x jj,(dx) < 0. Then the following conditions are eguivalent: 

(i) T is minimal for [i; 

(ii) for all stopping times R < S < T . 

E(B s \Fr) < B R a.s.; (1) 

(Hi) for all stopping times S <T, 

E(Bt|^s) < B s a.s.; (2) 

(iv) for all 7 > 

E{B T ;T> H- 7 ) < -7P(T > ff_ 7 ); 

(v) as 7 — *■ 00 

7 P(T > H_ 7 ) 0; 

(vi) the family {Bg } taken over stopping times S <T is uniformly integrable; 

(vii) for all x > 

E(B TAi? J = 0. 

In the case where supp(/i) C [a, 00) for some a < then the above conditions are also 
equivalent to the condition: 

( via ) 

P(T < H a ) = 1. (3) 

Remark 6. (i) Of course the Theorem may be restated in the case where m > by 
considering the process —B t . We will use this observation extensively in Section 
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(ii) Equation J5| is suggestive of the fact that when T is minimal, (■B(u/i-u)at)o<m<i is a 
supermartingale. To check this we need to show also that MB7^ T < oo for all t. We show 
this more generally, for a stopping time S <T. Using 0, 

E{B T ; B T < 0) < E(B T ; B s < 0) < E(B S ] B s < 0), 

so that EBg < EB^ < oo and the process is indeed a supermartingale. 

It follows from comparing parts (ii) and (Hi) of Theorem [5] that if S < T is a stopping 
time and T is minimal, then S is minimal provided EB$ < and E|_Bg| < oo. The first 
condition is a trivial consequence of QJ on taking R — 0, the second condition then follows 
from Remarklfjii) on noting that, since EB$ < 0, we have EBg < EBg < oo. 

Consequently we have the following corollary of Theorem [5] 

Corollary 7. If T is minimal and S < T for a stopping time S then S is minimal for 
C(B S ). 

For ease of exposition we divide the proof of Theorem into a series of smaller results. 
The first is a key result which shows that the strongest of the stopping time conditions 
is sufficient for minimality. Throughout this section it is to be understood that /i is a 
distribution with negative mean and T a stopping time embedding fi. 

Lemma 8. Suppose that for all stopping times R, S with R < S < T we have 

E(B S \T R )<B R a.s.. (4) 

Then T is minimal. 

Proof. Let R < T be a stopping time such that R embeds /z (so that E|.Br| = E|B<r| < oo). 
For a e K, 

sup E(a — Bt\ A) = E(a - B T ; B T < a) 
= E(a - B R ; Br < a) 

< E(a - B T ; B R < a) (5) 

< sup E(a-B T ;A) 

where we use (@J to deduce (JSJ). However since we have equality in the first and last expres- 
sions, we must also have equality throughout and so 

{B T < a} C {Br < a} C {B T < a}. 

Since this holds for all a £ K we must have Bt = Br a.s.. 

It follows that for S with R < S < T, B R = E[B R \T S ] = E[Bt\F s ] < B s , and since 
E[_B5|JF^] < Br we must have Bs = Br = Bt- Thus B is constant on the interval [R,T] 
and R = T . Hence T is minimal. 

□ 
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Lemma 9. If T is a stopping time such that Bt ~ t 1 an d 1^{T > #- 7 ) — > as 7 — > 00 
i/ien E|£?s < 00 and Ei?s < /or a/Z stopping times S <T. 

Proof. We show that, for S < T, EBg < 00 and EBg < EBg from which the result follows. 
Suppose 7 > 0. Since B tA n_ y is a supermartingale, 

E(B T ah_ t ; B s < 0, S < tf_ 7 ) < E(B SA H_., ; B s < 0, S < ff_ 7 ). 

We may rewrite the term on the left of the equation as 

E(B T ; B s <0,T< fl_ 7 ) - 7 P(B 5 < 0, 5" < ff_ 7 < T), 

and by hypothesis 7P(B S < 0, S < ff_ 7 < T) < 7P(ff- 7 < T) -> as 7 -> 00. Further, by 
dominated convergence, E(Bt] Bs < 0, T < i?_ 7 ) — ► E(Bt; < 0) and it follows that 

E(B S ; B s < 0) = lim E(S S ; B s < 0, S < if_ 7 ) > E(B T ; B s < 0). 

7— »00 

Hence EBg < -E(B T ; B s < 0) < EB T < 00. 

Again using the fact that B t ^H^^ is a supermartingale, > E(Bs A if_ 7 ) = E(Bs; S < 
ff_ 7 ) - 7P(i?- 7 < S) so that 

E(B^;5 < ff_ 7 ) < E(Bg;S < ff_ 7 ) + 7 P(H_ 7 < S). 

By monotone convergence the term on the left increases to EB^ , while by monotone conver- 
gence and the hypothesis of the Lemma the right hand side converges to EBg . Consequently 

EBg < EBg < OO. □ 

The previous two lemmas are crucial in determining sufficient conditions for minimality. 
Now we consider necessary conditions. 

Lemma 10. IfT is minimal then, for all 7 < 0, E(Bt — Btah~,) < 0. 

Proof. For 7 < let /( 7 ) = E(B T -B TAH ^) = E(B T - T ,T > # 7 ). Note that /(0) = m < 0. 
It is easy to see that for > 7 > 7' 

P(T e (H 1 ,H J ,))=¥(mi B s e (7', 7)) - 

s<T 

as 7 I 7' or 7' | 7. Since we may write 

/ (7) - /(V) = E(S T ; H 1 <T< H y ) + (7' - 7 )P(T > ffy) - 7 P(ff 7 < T < H Y ) 

it follows from the dominated convergence theorem that / is continuous. As a corollary if 
/(70) > for some 70 < 0, then there exists 71 e (70,0) such that /(71) = 0. 

Given this 71 , and conditional on T > H lx , let T" = T — H 7l , W t = Bh^ 1 +t —71, and 
fi" = C(Wt"). Suppose that T" is not minimal, so there exists S" < T" with law If we 
define 




T on T < H. 

H~, + S" on T > H, 
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then S embeds /i and S < T but S ^ T, contradicting the minimality of T. Hence T" is 
minimal. But then by Theorem [2 WtAT" is uniformly intcgrable and so, for 7 < 71 

K(w T " -( 7 -7i);r" > h™ 7i ) = o 

or equivalently 

/( 7 )=E(B T - 7 ;T>ff 7 ) = 0. 
Hence /( 7 ) < for all 7 6 (-00, 0]. □ 

Lemma 11. Suppose Bt ~ /i awe? to < 0. If E[BtahJ\ = /or all x > t/ien we /lave 
E[_Bt|-7\s] < f or oil stopping times S <T. 

Proof. Fix x > and define T x — T A H x . We begin by showing that T x is minimal for the 
centred probability distribution C{Bt„.)- Fix R < T x . The stopped process -B(t/i-t)AT x is 
a submartingale so E[B T J.F K ] > S fl and E[B fl ] > 0. Thus = E[B T J > E[B R ] > and 
Br = E[J5r x \Fb\- Since B t AT m is UI, by Theorem [21 we have that T x is minimal. 

Now fix S < T and define S x = S A H*. We show E[\B S \] < E[\B T \] < 00. We have 

^l\ B s\I{S<H x }] = ®[ B S x I{S<H*}I{B s >0}] -E [- B S«^{5<J?« 1! }^{B S <0}] 
= E^t^^S^H^I^Bs^O}] - E [^T x ^{S<H :c }^{Bs<0}] 
< E[|B T J/{T<H.}] 

Then, by two applications of Monotone Convergence E[|Bg|] < E[|£?t|] < 00. 

To complete the proof we show that for A e T s , E[B s Ia] > E[B T I A ]. Let A x = A(~\{S < 
H x }. Then E[BsIa] = hm^ E[Bs x IaJ\ = hm^ EIBt^Ia^} by Monotone convergence and the 
minimality of T x respectively Further, 

E[B T J A J > E[B t JaI{s<t<h x} ] E[B T I A \. 

□ 

We now turn to the proof of the main result: 

Proof of Theorem^ We begin by showing the equivalence of conditions (ii) - (v). It is 
clear that (ii) (in) =>■ (iv) => (v). Hence it only remains to show that (v) 

==^> (ii). Suppose (v) holds and choose stopping times R < S < T and A G Tr. Set 
A 1 = AC\ {R < H f }. Since Bt/\H—, is a supermartingale 

E{B SAH ^ ;Ay) < E(B R ^ H _ y ; Ay). (6) 

By Lemma E1E|_Br| < 00 and by dominated convergence the right hand side converges to 
E(Br;A) as 7 — > 00. For the term on the left we consider 

E(S SAff _ 7 ; Ay) = E{B S ; Ad(S < ff_ 7 )) - 7 P(A n (i? < ff_ 7 < 5)). 
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Again by Lemma and dominated convergence the first term on the right converges to 
K(Bs;A) while the other term converges to by (v). Hence on letting 7 — > 00 in Jfjjl we 
have 

E(B S ;A)<E(B R ;A) 

and we have shown (ii). 

We have already shown that minimality is equivalent to these conditions: (ii) ==> (i) 
is Lemma |H1 while (i) =^> (iv) is Lemma ITT)1 

Now consider (vi). If Q holds then Bg < E(Bt\Fs)~ < E{B T \T S ) and uniform 
integrability follows. Conversely, (vi) implies supg <T E(Bg ; Bg > 7) — > as 7 — > 00. 
Taking 5 = #_ 7 A T yields (uj. 

The implication (vii) implies (Hi) is the content of Lemma llll We now show (vi) implies 
(vii). Fix x > 0. Since Bg is uniformly integrable for all S, we have that B^ H AT is 
uniformly integrable. Since B tA H x AT is also bounded above, it follows that Bt^H^AT is UI. 
Hence E[B HxAT ] = lim t E[B tAHxAT ] = 0. 

We have shown equivalence between (i) - (vii). We are left with showing that if \l has 
support bounded below then (viii) is also equivalent. So assume that the target distribution 
fj, has support contained in [a, 00) for some a < and that T is an embedding of /i. Then 
BtAH a is a continuous supermartingale, bounded below and therefore if S < T < H a , 

E{B T \Fs) < B s . 

The reverse implication follows from considering the stopping time H a _ e = inf {t > : B t < 
a - e}, for then if A = : H Q _ e < T} and S = H a ^ e A T, 

(a - e)P(A) = E( J B ffa _ e ; A) = £(,85; A) > E(B T ; A) > aP(A) 

which is only possible if P(A) = 0. □ 

We close this section with a discussion of the case where the mean of the target distri- 
bution is well defined, but infinite. In this case the notion of minimality still makes perfect 
sense, but many of the conditions outlined in Theorem can be shown to be no longer 
equivalent to minimality. 

Suppose that fj, only places mass on K~ and that that 00, x) ) > + ior x < 0. 

Let (f> : M~ 1— ► R~ solve 

fi((-oo,x)) = 

1 + 10 1 {x)\ 

Such a is an increasing function with </>(x) < x. 

Define J = va£{B u ;u < Hi}. By construction 0(J) ~ [i and we can construct an 
embedding of /i by setting T = inf{u > Hi : B u = 4>{J)}. Further, at the stopping time T 
the Brownian motion is at a current minimum. Hence T is minimal. 

Now consider the alternative conditions in Theorem In reverse order, (vii) and (vi) 
both clearly fail, whereas (v) may or may not hold depending on the target law /z, and 
(iv) holds trivially. (However (iv) holds for all embeddings of /i so that it is not sufficient 
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for minimality.) The condition in (m) also holds, since on stopping B is at a minimum 
value. However the choice R = and S = Hi shows that (ii) fails and is not necessary for 
minimality. Hence, in the case where m — J R xfi(dx) is well defined but equal to — oo, the 
only condition from the list in Theorem which could possibly be necessary and sufficient 
for minimality is (Hi). 

3 An extension of the Azema-Yor embedding to the 
non-centred case 

Let jit be a target distribution on K, and let B t be a Brownian motion with Bo = 0. Define 
Mt = sup s<T B s and Jt — inf s <r B s . In this section we are interested in finding an 
embedding to solve the following problem: 

Given a Brownian motion (B t )t>o and an integrable (but possibly not centred) 
target distribution fi with mean m, find a minimal stopping time T such that T 
embeds fj, and 

F(M T > x) 

is maximised over all minimal stopping times T embedding /i, uniformly in x. 

We call an embedding with this property the max-max embedding, and denote it by T max . 

Without some condition on the class of admissible stopping times the problem is clearly 
degenerate — any stopping time may be improved upon by waiting for the first return of the 
process to after hitting level x and then using the original embedding. For this improved 
embedding P(Mt > x) = 1. Further, since no almost surely finite stopping time can satisfy 
P(Mt > x) = 1 for all x > 0, there can be no solution to the problem above in the class of 
all embeddings. As a consequence some restriction on the class of admissible stopping times 
is necessary for us to have a well defined problem. 

Various conditions have been proposed in the literature to restrict the class of stop- 
ping times. In t he case where m = 0, th e condition on T that B t/ \T is a UI martingale 
was suggested by Dubins and Gilatl 1 197sh. and in th is case the maximal embedding is the 



Azema-Yor embedding. When m = M onroe! I|1972|) tells us that minimality and uniform 
integrability are equivalent conditions, so the Azema-Yor st o pping time is the max-max em- 



bedding. For the case where m > 0. iPedersen and P eskir (2001) showed that EMy < oo 



m 

(2( 



is another suitable condition, with the optimal embedding being based on that of Azema 
and Yor. We argue that the class of minimal embeddings is the appropriate class for the 
problem under consideration since minimality is a natural and meaningful condition, which 
makes sense for all m (and which, for m > 0, includes as a subclass those embeddings with 
E(M T ) < oo.) 

We now describe the construction of the candidate max-max stopping time. This con- 
struction is an extension of the classical lAzema and Yorl l)1979c ) stopping time. However 



we derive the stopping rule in a slightly non-standard way via a 



Chacon and Walshl l|l97' 



9 




style a rgument. This i nterpretation of the Azema-Yor construction (in the centred case) is 



Meiliiso 



n 



due to 

In the following we treat the cases m > 0, m — and m < in one go, since the basic 
idea is identical. Define the convex function: 

c{x) :=W\X -x\ + \m\. (7) 

We note that c is related to the potential of /z, that it is Lebesgue-almost everywhere 
differentiable with left-derivative c'_{x) = 1 — 2/i([a;, oo)) and c{x) — \x\ —> \m\ =p m as 
x — > ±oo. For 6 [—1, 1] define 

u{6) := 'mt{y e K : c(y) + 9(x -y)< c(x),Vx G M}, 
_ c( M (fl)) - 

z+w i - e ' 

and (note that z^ 1 is well defined) for x > 

6(x) := u(z^ 1 (x)). 

The intuition behind these definitions is as follows. Let 9 denote a gradient and consider 
the unique tangent to c with gradient 6. Then z + (9) is the ^-coordinate of the point where 
this tangent crosses the line y — x. Similarly, b(x) is the ^-coordinate of the point for which 
the tangent to c at that point passes through (x,x). The mathematical definitions above, 
and the fact that u is left-continuous, ensure that b is well defined. Note that when m = 0, 
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a few lines of calculus are sufficient to check that b 1 is precisely the barycentre function 
which arises in the classical Azema-Yor construction. 

Theorem 12. Let T be a stopping time of (B t )t>o which embeds /i and is minimal. Then 
for x > 

P(M T > x) < \ inf ■ (8) 

2 \<x \ x — A J 

Define the stopping time T max via 

T max := inf{< > : B t < b(M t )}. (9) 
Then T max embeds [i, is minimal, and attains equality in (jSJ) for all x > 0. 

Remark 13. (i) By the comments before the theorem relating b to the barycentre function, 
when m = the above theorem is a restatement of the classical Azema-Yor result, which by 
Theorem|2can be stated in terms of minimal, rather than uniformly integrable embeddings. 
For m > 0, b(x) — — oo for x < m, and consequently T rnax > H m . Moreover, for x > m, 6" 1 
is a shifted version of the barycentre function. Consequently, when m > the embedding 
T m ax may be thought of as 'wait until the process hits m then use th e Azema-Yor embed- 
ding,' and the conclusion of Theorem 1 121 is similar to Proposition 3.1 of 



Pedersen and Peskii 



(2001), except that T rnax is shown to be optimal amongst the larger class of minimal stop- 
ping times rather than the class of embeddings for which Mt G L 1 . However, the truly 
original part of the theorem is in the case m < 0. In that case the embedding 'wait until the 
process hits m and then use the Azema-Yor embedding' does not achieve equality in |JS}. 

(ii) Note that 

c O) - A _ 1 _ x - c(A) 
x — A x — A 

We can relate the right-hand-side of (|10fl to the slope of a line joining (x,x) with (A,c(A)). 
In taking the infimum over A we get a tangent to c and a value for the slope in [—1,1]. Thus 
the bound on the right-hand-side of |(HJ lies in [0, 1]. 

(iii) T max has the property that it maximises the law of Mj over minimal stopping times 
which embed /i. If we want to minimise the law of the minimum, or equivalently we wish 
to maximise the law of —Jt, then we can deduce the form of the optimal stopping time 
by reflecting the problem about 0, or in other words by considering — B. Let T m i n be the 
embedding which arises in this way, so that amongst the class of minimal stopping times 
which embed fx, the stopping time T m i n maximises 

P(-J T > x) 

simultaneously for all x > 0. 

We now turn to the proof of Theorem 1 121 
Proof. The following inequality for x > 0, A < x may be verified on a case by case basis: 

1 



<-{M T >x} 



< 



x — A 



, \B T -\\-{B T + X) 
VtaHx H 7i 



(11) 
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In particular, on {Mt < x}, (fTTf) reduces to 





and on {Mt > x] we get 

Then taking expectations, 

P(M T > x) < 



< 



1 < 



--A 



c-A 



c-B T 
x — X 



1 



A 



E(B' 



T/\H V 



X>B T 
A < B T , 

A > Bt 
A < B T . 

c(A) — \m\ — (m + A) 



(12) 



(13) 



(14) 



If to < then by Theorem [SJfTO,) and the minimality of T we know E(Btah :c ) — and so 

P(M r >*)<±£W^. 

Z x — A 

Conversely if m > 0, by Theorem [5[m) applied to —B, 

m = E(B T )>E{B TAHx ) (15) 

and so 

¥(M T >x)< 



1 



A 



c(A) -2m- X 



l c(A) 
2 x- 



X 



Since A was arbitrary in either case, JSJ must hold. 

It remains to show that T max attains equality in JHJ) , embeds fi and is minimal. We begin 
by showing that it does attain equality in JHJ. Since 



c(A) - A 



= 1 



c(A) 



x—X x—X 

the infimum in JHJ is attained by a value A* with the property that a tangent of c at A* 
intersects the line y = x at (x,x). By the definition of b we can choose A* = b(x). In 
particular, since {M Tmax < x} C {B Tma!C < b(x)} and {M Tmax > x} C {i?T maa; > ^(^)}, the 
stopping time T maa; attains equality almost surely in l|12|) and i|13|l . Assuming that T mQa; 
is minimal, we are then done for m < 0. If to > we do not always have equality in ()15l) . 
If x < to then E(i?T moa; Aff :c ) = x, but then A* = — oo and so the term E(Br ma:c ah x ) — 
m)j{x — A*) in l|14|) is zero. As a result equality is again attained in Otherwise, if 
x > to then T maa; > ff m and the properties of the classical Azema-Yor embedding ensure 
that E(BT max AHx) = m an d there is equality both in l|15|l and ©• 

Fix a value of y which is less than the supremum of the support of fi. Let b^ 1 be the 
left-continuous inverse of b. Then, since we have equality in (JHJ, we deduce: 



>y) 



W(M Tmax > b^iy)) 



c(b(b-\y)))-b-\y) 



b-i(y)-b(b-\y)) 
(l-c'_(2/)) 
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where c'_ is the left-derivative of c. It is easy to see from the definition J7J that this last 
expression equals /i([y, oo)). Hence T max embeds /i. 

Now we consider minimality of T max . It is well known that in the centred case T max 
is uniformly integrable, and hence by Theorem T max is minimal. Suppose m > 0. 
By an analogue of Theorem EJw), in order prove minimality it is sufficient to show that 
lim^foo xP(T max > H x ) — ^ 0. But, by the arguments in the previous paragraph 

2xP(T max >H X ) = 2aP(M Tra „ >x)=x[l- J_{b{x))] 

This last quantity is exactly the height above 0, when it crosses the j/-axis, of the tangent 
to c which passes through (x, x). As x | oo this height decreases to zero. 

Now suppose m < 0. By Theorem Efyj, in order to prove minimality it is sufficient to 
show that as £ j — oo, |x|P(T maa; > H x ) — > 0. We have 

\x\F(T max > H x ) = \x\F(H x < H b - Hx) ) = Jt^itL < 

It is easy to see from the representation of b that b~ 1 (x) tends to zero as x — > — oo. □ 



4 An embedding to maximise the modulus 



Jackal l)1988|) shows how to embed a centred probability distribution in a Brownian motion 
so as to maximise P(sup t<T \B t \ > y). Our goal in this section is to extend this result to 
allow for non-centered target distributions with mean m =/= 0. In fact we solve a slightly more 
general problem. Let He a measurable function; we will construct a stopping time T moc i 
which will maximise P(sup t<T \h(B t )\ > y) simultaneously for all y where the maximum 
is taken over the class of all minimal stopping times which embed fi. The reason for our 
generalisation will become apparent in the application in the next section. 

Without loss of generality we may assume that h is a non-negative function with h(0) = 
and such that for x > both h(x) and h{—x) are increasing. To see this, observe that for 
arbitrary h we can define the function 

h{x) 



maxQ< y < x \h(y)\ - |/i(0)| x > 0; 
max x < y < \h(y)\ - \h(0)\ x<0. 



Then h has the desired properties and since 

sup \h(B s )\ = sup h(B a ) + \h(0)\ 

s<T s<T 

the optimal embedding for h will be an optimal embedding for h. 

So suppose that h has the properties listed above. We want to find an embedding of fx in 
B which is minimal and which maximises the law of sup t<T h(B t ). (Since h is non-negative 
we can drop the modulus signs.) Suppose also for definiteness that \i has a finite, positive 
mean m = J R x[i(dx) > 0. In fact our construction will also be optimal when m — (the 
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Figure 2: c(x) for a distribution /i showing the construction of z + (9q) and Z-{6q). The 
slope of the tangent is 9o where 9q has been chosen such that (assuming h is continuous) 
h{z+{0 o )) = h(-z-(9 )). 



case covered by|Jacka ( 1988J) ) , but in order to avoid having to give special proofs for this 
case we will omit it. 

Recall the definitions of c and u from the previous section. Define 

z+m ._ cwg)-^ (16) 

z _ (e) : _ (17) 

so that — Z($) is the x-coordinate of the point where the tangent to c with slope 9 intersects 
the line y = — x, and set 

6 Q := inf{0 G [-1, 1] : h(z+(0)) > h(-z^(9))}, 

as pictured in Figure |21 Our optimal stopping time will take the following form. Run 
the process until it hits either z+(9o) or — z_(#o), and then embed the restriction of \x to 
[u(9q),oo) or (— oo,u(#o)] respectively (defining the target measures more carefully when 
there is an atom at u(9q)). For the embeddings in the second part, we will use the construc- 
tions described in Section EJ 

To be more precise about the measures we embed in the second step, define 

p:=V(H z+(eo) <H_ z _ (eo) )- : '" i - > 



z + (9 ) + z-{0 y 
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, _ --:->"n i — (#o) _ -. r. 

Let /x+ (respectively /^_) be the measure obtained by conditioning a random variable with 
law fi to lie in the upper p^ 1 (respectively lower (1 — p)^ 1 ) quantile of its distribution. 
Recall that we have taken m > and that 

c (v) = / \ w - v\^{dw) + |m| = 2/ (w - y)fi(dw) + y (18) 
J "'{™>a} 

= 2 / (y - w)n(dw) + 2m~y (19) 

J {w<y} 

Then using l|18|) in the definition 116(1 we have that 

(w - u(6 )) n(dx) + 2pu(6 ) ) 



ji'>«(e )} 



= - / wfi(dw) + u(0 Q ) ( 1 /z((u(0o),°o)) 

Pi{ti)> M (9(,)} V P 

In particular z+(#o) is the mean of since p, + ({u(6o)}) = 1 — ^fi((u(0o), oo)). When we 
repeat the calculation for Z-(0q) using i|19fl we find that 



-z_(fl ) = : / wfx(dw) +u{6 ) ( 1 - — !— /z((-oo, it(0o)) 



1 - P J{w<u(e )} V 1-P / 

Note that — z_(#o) is strictly smaller than the mean of 

We now describe the candidate stopping time T mo rf = T^ od . Note that this stopping 
time will depend implicitly on the function h via z±(0q). Let 

T := inf{< > : B t £ (-Z-{0 o ),z+{O o ))}, 

and define 

Tmtx ° 9t Bt — z + (9o) 
T^ m oO To B To = -z_(e ). 

Here we use 0t o to denote the shift operator, and T&tx is the stopping time constructed 
in Section [3] for a zero-mean target distribution, so that Ti^ ax is a standard Azema-Yor 
embedding of the centred target law p, + . (Recall that z + (0 O ) is the mean of the corresponding- 
part of the target distribution.) Similarly T^ n is the stopping time applied to —B started 
at — z_(#o) which maximises the law of the maximum of —B. In this case the mean of the 
target law is larger than — z-(0 o ) so that in order to define T^~ n we need to use the full 
content of Section [3] for embeddings of non-centred distributions. 

The following theorem asserts that this embedding is indeed an embedding of fx, that it 
is minimal, and that it has the claimed optimality property. 
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Theorem 14. Let fi be a target distribution such that m > 0. Then within the class of 
minimal embeddings of fi in B, the embedding T mo d as defined above has the property that 
it maximises 



sup h(B t ) > x 

t<T 



simultaneously for all x. 



Proof. By construction T mo d embeds fi. We need only show that it is optimal and minimal. 

For x < h(—Z-(9o)) A h(z + (9 )) we know the probability that {sup t<T h(B t ) > x} is 
one and so, for such x, T mo d is clearly optimal. Indeed if h is discontinuous at — z_(6*o) or 
z + (9q) slightly more can be said. Note first that if z+(Qq) coincides with the supremum of the 
support of fi, then by Theoiem\5^iii) and the minimality of T mo d (see below), the stopped 
Brownian motion can never go above z + (6q). Hence we may assume that h is constant on 
the interval to the right of the supremum of its support. 

Define 



L = lim h(y) A lim h(y) 

and take x < L. Then either Bt„ — z+(0o) or Bt = —z-{0q). If Bt = z+(&o) then either 
MT mod > z+(0o) almost surely and 

max h(B t ) > lim h(y) > L 
o<t<T mod yiz+(e ) 

or z + (0q) is the supremum of the support of fj, and 

max MBA = h(B Tn ) > L. 

0<t<T mod 

Similar considerations apply for Bt = —z-(6o) except that then —JT mod > z_(#o) m a ^ 
cases. We deduce that for x < L 



sup h(B t ) > x ] = 1 

t<T mod J 

and hence T moc i is optimal for such x. 

So suppose that x > L. For any stopping time T embedding fj,, the following holds: 



sup h(B s ) > x < P (h(M T ) > x) + P (h(J T ) > x) . (20) 

s<T J 

We will show that the embedding T mo d attains the maximal values of both terms on the 
right hand side, and further that for T mo d the two events on the right hand side are disjoint. 
Hence T mo d is optimal. 

By the definition of 9 , x > (h(z + (6 ))) V (h(-Z-(6 ))). It follows that 

F(h(M Tmod ) >x)= P V(h(M Tmod ) > x\B To = z+{9 a )) 

and by the definition of T mo d and the properties of T max , we deduce 

nHM Tmod ) >x) = p¥{h{M T »J > x\M T&am > z+(6 )) 
= P(/i(M t m ) > x) 
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where here T£ ax is the embedding of Section [31 applied to fi. A similar calculation can 
be done for the minimum. In particular T moc i inherits its optimality property from the 
optimality of its constituent parts Tmt.x and T^ n 

Finally we note that T moc i is indeed minimal. Consider the family of stopping times 
S < T mo d- On the set where Bt = — z-(6*o) we have that Bg is bounded, whereas on the 
set Bt — —Z-(6q) the minimality of T^~ n ensures that Bg is uniformly integrable. Hence, 
combining these two cases, B^ is a uniformly integrable family and the minimality of T mo d 
follows froml^w) applied to a target distribution with positive mean. □ 

Remark 15. If the restrictions of h to R+ and K_ are strictly increasing then T mo d will be 
essentially the unique embedding which attains optimality in Theorem 1 141 If however h has 
intervals of constancy then other embeddings may also maximise the law of sup (<T \h(B t )\. 



5 Embeddings in diffusions 

Our original motivation in considering the embeddings of the previous sections was their use 
in the investigation of the following question: 

Given a regular (time-homogeneous) diffusion (X t ) t >o and a target distribution 
fix , find (if possible) a minimal stopping time which embeds fix and which max- 
imises the law of sup t<T X t (alternatively sup t<T \X t \) among all such stopping 
times. 

Note that in the martingale (or Brownian) case it is natural to consider centred target 
laws, at least in the first instance. However in the non-martingale case this restriction is no 
longer natural, and as we shall see below is completely unrelated to whether it is possible 
to embed the target law in the diffusion X. It was this observation which led us to consider 
the problem of emb edding non-centred distr ibutions in B. 



The key idea fsee lAzema and Yorl ( 19795)) is that we can relate the problem of embedding 
in a diffusion to the case where we are dealing with Brownian motion via the scale function. 
There exists a continuous, increasing function s such that s(X t ) is a local martingale, and 
hence a time-change of Brownian motion. Then the requirement X T x ~ fix translates 
to finding an embedding of a related law in a Brownian motion B, and the criterion of 
maximising sup t<T X t also has an equivalent statement in ter ms of B. 

We first recall the properties of the scale function (see e.g. Rogers and Williamsl l|200(i 
V.45)). If (X t )t>o is a regular (time-homogeneous) diffusion on an interval ICR with 
absorbing or inaccessible endpoints and vanishing at zero, then there exists a continuous, 
strictly increasing scale function s : I — > K such that Y t = s(X t ) is a diffusion in natural 
scale on s(I). We may also choose s such that s(0) = 0. In particular Y t is (up to exit 
from the interior of s(I)) a time change of a Brownian motion with strictly positive speed 
measure. For defmitcness we write Y t = B Tt . 
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We suppose also that our target distribution fix is concentrated on the interior of /. 
Then we may define a measure fi = fiy on s(J)° by: 



H(A) = fiy(A) = fXxis-^A)), A e s(I)°, Borel 

The original problem of embedding fix in X is equivalent to the problem of embedding fiy 
in Y before Y exits the interval s(I)°. Since Y is a time change of a Brownian motion, we 
need only consider the problem of embedding fiy in a Brownian motion before exit from 
s(I)°. If T is an embedding of fi in B then T x = t _1 (T) is simultaneously an embedding 
of fiy in Y and /ix in X. 

The first question is when d oes any embedding exist? I f we 



then the following lemma (sec Pcd ersen and Peskirl 1)200 111 and ICox and Hobsonl (J2004)) 



d efine m = fj s(x) p, x(dx ) , 
1 ICox and Hobsonl hooh 



gives us necessary and sufficient conditions for an embedding to exist. 

Lemma 16. There are three different cases: 

(i) s(J)° = R, in which case X is recurrent and we can embed any distribution fix on 1° 
in X . 

(ii) s(I)° = (—oo,a) (respectively (—a,oo)) for some a > 0. Then we may embed fix in 
X if and only if m exists and m > (resp. m < 0). 

(Hi) s(I)° — (a, (3), a < < (3. Then we may embed fix in X if and only if m = 0. 

In each case it is clear that: 

T x is minimal for X T x is minimal for Y T is minimal for B, 

where T = t(T x ). Further, since fi = fiy is concentrated on s(J)°, if the stopping time T 
is minimal then T will occur before the Brownian motion leaves s(T)° (this is a consequence 
of Theorem0in case (ii) and Theorem[5]in case (Hi)), and then T x will be less than the 
first explosion time of X . 

It is now possible to apply the results of previous sections to deduce a series of corollaries 
about embeddings of fix in X. Suppose that fix can be embedded in X or equivalently that 
fiy can be embedded in B, before the Brownian motion leaves s(I)°. Suppose further that 
in the recurrent case where s(I)° = R the law fi — fiy is integrable. Let T max and T^ od 
be the optimal embeddings of fi in B as defined in Sections and (Observe that from 
now on we make the dependence of T^ od on h explicit in the notation.) Then we can define 
T*ax and T*^ by 

rpX — 1 rp rriX,h — 1 rph 

1 max^ T ° 1 max 1 mod^ T 1 mod- 

Corollary 17. T x ax is optimal in the class of minimal embeddings of fix in X in the sense 
that it maximises 



maxXt > y 

\t<T 



uniformly in y > 0. 
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Corollary 18. is optimal in the class of minimal embeddings of [ix in X in the sense 

that it maximises 



m&x(/i o s)(X t ) > y 
uniformly in y > 0. 

Corollary 19. ' is optimal in the class of minimal embeddings of fj,x in X in the 

sense that it maximises 



(max|Y t |> y ) 



uniformly in y > 0. 
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